A Valid Candidate Approach to Mining Bi-Directional Traversal Patterns on the WWW
نویسندگان
چکیده
Mining traversal patterns is one of important topics in Web mining. It focuses on how to find the Web page sequences which are frequently browsed by users. In this paper, we propose two algorithms for mining traversal patterns. For the first algorithm, SpeedTracer*-I, it is a revised version of the SpeedTracer algorithm. It directly generates and counts all candidate patterns from user sessions. Moreover, it improves the checking step when candidate patterns are generated. Next, based on the SpeedTracer*-I algorithm, we propose the SpeedTracer*-II algorithm, which improves the performance of the SpeedTracer*-I algorithm by decreasing the times to scan the database. From the simulation results, we show that the SpeedTracer*-I algorithm needs less processing time than the SpeedTracer algorithm. Moreover, the SpeedTracer*-II algorithm needs less processing time than SpeedTracer*-I and Apriori-like algorithms (e.g., FS and FDLP algorithms). (keywords: association rules, data mining, traversal patterns, Web mining, WWW)
منابع مشابه
A Top-Down Algorithm for Mining Maximal Traversal Paths in Web Log Sessions
Mining of frequent traversal paths in web logs is an application of sequence mining and useful with many applications that include web recommendation, caching, pre-fetching etc. Most of the existing algorithms follow a bottom-up approach to mine sequence patterns in a database. In this paper, a fast top-down algorithm is presented to discover maximal traversal paths which are contiguous sequenc...
متن کاملDevelopment of Mathematical Model for Controlling the Drilling Parameters with a Screw Downhole Motor
Article presents results of study on possibility of increasing the efficiency of drilling directional straight sections of wells using screw downhole motors (SDM) with a combined method of drilling with rotation of drilling string (DS). Goal is to ensure steady-state operation of SDM with simultaneous rotation of DS by reducing the amplitude of oscillations with adjusting the parameters of dril...
متن کاملMining Sequential Patterns from Probabilistic Databases by Pattern-Growth
We propose a pattern-growth approach for mining sequential patterns from probabilistic databases. Our considered model of uncertainty is about the situations where there is uncertainty in associating an event with a source; and consider the problem of enumerating all sequences whose expected support satisfies a user-defined threshold θ. In an earlier work [Muzammal and Raman, PAKDD’11], adapted...
متن کامل2FACE: Bi-Directional Face Traversal for Efficient Geometric Routing
We propose bi-directional face traversal algorithm 2FACE to shorten the path the message takes to reach the destination in geometric routing. Our algorithm combines the practicality of the best singledirection traversal algorithms with the worst case message complexity of O(|E|), where E is the number of network edges. We apply 2FACE to a variety of geometric routing algorithms. Our simulation ...
متن کاملAn Effective System for Mining Web Log
The WWW provides a simple yet effective media for users to search, browse, and retrieve information in the Web. Web log mining is a promising tool to study user behaviors, which could further benefit web-site designers with better organization and services. Although there are many existing systems that can be used to analyze the traversal path of web-site visitors, their performance is still fa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006